Open In Colab

Capstone Project - NLP Chatbot

DOMAIN:

Industrial safety. NLP based Chatbot.

CONTEXT:

The database comes from one of the biggest industry in Brazil and in the world. It is an urgent need for industries/companies around the globe to understand why employees still suffer some injuries/accidents in plants. Sometimes they also die in such environment.

DATA DESCRIPTION:

This The database is basically records of accidents from 12 different plants in 03 different countries which every line in the data is an occurrence of an accident.

Columns description:

Link to download the dataset: https://drive.google.com/file/d/1_GmrRP1S2OIa02KlfOBNkYa8uxazGbfE/view?usp=sharing, Original dataset link: https://www.kaggle.com/ihmstefanini/industrial-safety-and-health-analytics-database

PROJECT OBJECTIVE:

Design a ML/DL based chatbot utility which can help the professionals to highlight the safety risk as per the incident description.

Replace the folder path with your drive folder path and clone the githb repo

Run the below code for clone the repo in our Googele drive for first time

Change the Current folder into Github repo main folder, all the functions we are calling has to be in this repo.

Run this code to update the changes in repo to our drive

N Gram Analysis:

LSTM:

Glove Embedding

Base Model

Complex LSTM Model:

Model Predictions:

NLP Augmentation:

Augumenting Data using Bert base Uncased

Base Model

Complex Model

Transformer Models

Transformer Models Distill Bert